CDS
Accession Number | TCMCG075C05479 |
gbkey | CDS |
Protein Id | XP_017970879.1 |
Location | join(4439507..4439667,4439965..4440408,4440512..4440665,4441202..4441373,4442015..4442088,4442305..4442401,4442487..4442599,4442719..4442834,4443441..4443498,4444395..4444547,4444669..4444767,4444874..4445038,4445226..4445294,4446038..4446126,4446260..4446422,4446856..4446990,4447088..4447193,4447286..4447380,4447489..4447601,4447714..4447924,4448126..4448194,4448332..4448474,4448806..4448907,4449133..4449280,4449805..4449861,4449936..4450066,4450216..4450304,4450415..4450767,4450881..4450976,4452358..4452422,4453342..4453444,4453573..4453731,4453819..4453959,4454104..4454147,4454230..4454344,4454652..4454804,4454937..4455104) |
Gene | LOC18607815 |
GeneID | 18607815 |
Organism | Theobroma cacao |
Protein
Length | 1640aa |
Molecule type | protein |
Topology | linear |
Data_file_division | PLN |
dblink | BioProject:PRJNA341501 |
db_source | XM_018115390.1 |
Definition | PREDICTED: UDP-glucose:glycoprotein glucosyltransferase isoform X1 [Theobroma cacao] |
EGGNOG-MAPPER Annotation
Sequence
CDS: ATGGAGACCCGTTTTAGATCTCGGCTTTGCATTTTGATCGTTCTTGCTTGTGTAATCTTCTGTGGGTTCACTTCCGTTGGAGCTCAAAATCGAAGGCCAAAGAACGTTCAAGCTGCGATTCGGGCTAAGTGGTCGGGTACTCCATTGCTTTTGGAAGCCGGTGAATTACTTTCTAAAGAGTCGAAAAATCTTTTCTGGGAGTTCTTTGACGACTGGCTACATGTTGCAAAAACTGGTGGTGATTCTCATTCAGCTAAAGACTGCCTTAAAAAAATCTTGAAACATGGCAGCTCTCTTTTAAGTGAAACCTTGTCATCATTATTTGAATTCTCCTTAACTCTAAGATCAGCATCCCCGAGATTGGTGCTTTATCGGCAATTAGCAGAGGAGTCTCTTTCTTCTTTTCCACTGGGTGATGATAGTTACTCAAACAATGTGAACGGGCTAGATGCTAGTGAAACCTTAGAAACTATAAAGTTGGATCCCTTGCTTGTTGGTATAAATCCAAGGAGTCCTGGTGGAAAATGTTGTTGGGTGGACACTGGTGGGGCACTGTTCTTTGATGTTGCAGAACTGCTGTTGTGGCTTCAGAGACCTAATGAACTCGGTGTAGACTCCTTTCAGCAGCCTGAATTATATGATTTTGATCACATCCATTTTGATTCAAATATTATGAGCCCAGTTGCTATTCTGTACGGTGCTCTTGGAACAAATTGTTTTAAGGAGTTCCATGTTACCCTAGTTCAAGCTGCCAAAGAGGGAAAAGTTAAATATGTTGTCAGACCAGTATTACCTTCTGGTTGTGAAGCAGAAGTTGGCTTGTGTGGAGCTGTTGGCGCAAGGGATTCCTTAAACTTGGGTGGCTATGGTGTGGAGCTTGCTTTGAAGAATATGGAATATAAAGCAATAGATGACAGTACCGTAAAGAAAGGTGTAACCCTAGAAGACCCTCGGACTGAAGATCTTAGCCAAGAAGTTAGAGGGTTTATATTCTCAAAAATGCTGGAACGCAAACCTGAGCTTACTTCTGAGATAATGGCTTTTAGGGATTATCTAATGTCATCGACAATATCAGATACTCTTGATGTTTGGGAACTGAAAGATTTAGGACATCAAACTGCTCAGAGAATAGTACAAGCCTCTGATCCTTTGCAGTCAATGCAAGAAATCAGCCAAAATTTTCCGAGTGTAGTTTCTTCTTTATCTCGGATGAAGCTCAATGATTCTGTAAAAGATGAAATAATTGCAAACCAGAGAATGATCCCCCCTGGCAAGTCTTTAATGGCTCTGAATGGTGCTTTAATCAATATCGAAGATATTGACCTTTATCTGTTGATTGACTTAATCCATCGGGAGCTATCATTGGCTGATCAGTTTTCAAAATTGAAGATCCCTCAAGGCACTGTACGGAAGCTTCTATCAACTGTGACTCCTCCTGAGTCAGATATGTTTCGTGTTGATTTTCGTTCTTCCCATGTTCATTATCTCAATAACTTGGAGGAGGATGCTATGTATAGGCGATGGAGGAGTAATATAAATGATATTTTAATGCCTGTCTTCCCCGGCCAGCTACGTTATATCCGTAAGAATCTGTTCCATGCAGTTTATGTTCTCGATCCAGCAACAGTTTGTGGTCTTCAGTCCATCGATATGATTACAACTTTCTATGAGAATAGTTTCCCAATGAGATTTGGGGTGATACTGTATTCTACACAGTTCATCAAGAAGATTGAAATGAGTGGTGGTGAACTTCATTCATCTTCGTTGGAGCATGACAGTGAAATTGAGGATGATAAATCCATTTTGATTATACGACTTTTCATCTATATTAAGGAAAACCATGGGACTCAAACTGCTTTTCAATTTTTAAGCAATGTAAATCGACTACGAATTGAATCTGCTGAGTCCACTGATGATGCTCTTGAAATGCACCATATTGAAGAGGCATTTGTGGAAACAGTATTACCGAAAGCAAAATCTCCACCACAAGAAGTACTGTTGAAGCTGCAAAAGGAATCGACTTTTAAGGAACTGTCTGAAGAGAGCTCCTTGTTTGTTTTTAAGCTGGGTGTGGGCAAGCTGCAGTGTTGCCTTTTGATGAATGGTCTTGTCCTTGATTCTAGTGAGGAAGCACTTATAAATGCCATGAATGATGAACTTCCCAGAATACAAGAACAAGTTTACTATGGGCAAATAAATTCGCACACTGATGTGCTGGACAAATTCCTATCAGAAAATGGTGTCAGTCGATATAATCCGCAGATTATTGTTGATGGCAAGGCTAAACCAAGGTTTATATCTCTGGCTTCATCAATCCTTGGAGGGGAATCTGTACTGAATGACATTAATTATTTACATTCTCCTGAAACTGTGGATAATGTGAAGCCTGTAACTCATCTTCTTGCTGTTGATATCACATCAAAAAAAGGGATAAAGTTACTTCGTGAAGGAATTCGTTATCTGATTGGAGGGACCAAAGGTGCTCGTGTTGGTGTGCTATTCAGTGCTAGTCAGGATGCTAATTTACCTAGTCTTCTACTTGTGAAGACCTTTGAGATCACTGCAGCCTCATATAGTCATAAGAAGAAGGTCTTAGAGTTTTTAGATCAAGCATGCTCATTTTATGAGCATAATTACATTGTTAGATCTCCCACATCGGCTGAAAGCACTCAAGCATTCATTAACAAGGTCTATGAACTTGCTGAGGCAAATGAACTGTCATCTAAGGCATATAAATCCTCTCCACCAGAAGCTTCTGCTCAGGAGTTGAGAGAACACTTGAATAAGGTGGCCCAATTTTTATATAGACAATTTGGGATTGCATCTGGCGTTAATGCAGTTATTACCAATGGAAGGGTTACTTCTCTAGATGCTGGCGTATTTCTGAGCCATGATCTGCATCTTCTGGAGTCAGTTGAGTTCAAGCATAGAATAAAGCACATTGTACAGATTATTGAGGAAGTTAATTGGCAGGGCTTAGATCCTGACATGCTAACAAGTAAATATGTTAGCGATATTGTTATGTTCGTGTCATCTTCAATGGCTACAAGGGATCGAAGTACTGAAAGTGCCCGCTTTGAGGTTTTGAATGCACAACATAGTGCTGTTGTTCTAAATAATGAGAATTCTAGTATTCATATTGATGCGGTTGTTGATCCTTTAAGCCCATTTGGTCAGAAACTATCCTCACTTCTCCGAGTTCTGGCTAAGTATGTCCATCCAAGCATGCGCATTGTACTAAATCCCTTGAGTTCCCTTGTTGATCTTCCACTGAAGAACTATTACAGATATGTTGTCCCAACAATGGATGATTTCAGCAGTACTGATTACACAGTAAATGGACCCAAAGCATTTTTCGCAAATATGCCATTGTCCAAAACACTCACCATGAATCTGGATGTTCCTGAGCCATGGCTTGTTGAGCCCATCATTGCTGTTCATGACCTGGACAATATTTTGCTTGAAAACCTGGGTGAGACAAGAACATTACAAGCAGTTTTTGAACTTGAAGCTCTTGTTCTGACTGGTCATTGTACCGAGAAAGATCGTGACCCTCCCAGAGGTCTTCAGCTAATTCTTGGAACAAAGAATACACCTCATTTGGTTGATACTATTGTCATGGCCAATTTGGGCTATTGGCAGATGAAAGTATCACCTGGAGTTTGGTACCTACAACTTGCTCCCGGCAGAAGTTCTGAGTTATATCTTTTTAGGGATGGTGGTGACAATGGAAGTCAAGAGAAATCTTTGTCAAAGCGCATCACTATAAATGATTTGCGGGGTAAAGTAGTTCATCTAGAAGTAGTAAAGAAGAAAGGAAAAGAGCATGAAAAATTGCTTATATCAGCCGATGATGACAGCCATTCAAAAGAAAAGAGGCAGGGACATAATGGCTGGAACTCAAACTTTTTAAAATGGGCTTCTGGTTTTATTGGTGGCAGTGAGCAATCAAAAAAGAATAATGACAGTTTGGTGGAGCATGGGAAGGGTGGACGACTTGGAAAAGCAATTAACATATTTTCAATTGCTTCAGGACATTTATATGAGCGCTTCCTGAAAATTATGATTTTAAGTGTATTAAAGAATACGCGTCGTCCAGTGAAATTCTGGTTTATAAAGAACTACTTGTCTCCTCAGTTCAAGGACGTGATTCCACATATGGCACAGGAATATGGCTTTGAGTATGAACTAATTACCTACAAATGGCCTACATGGTTACATAAGCAGAAAGAAAAGCAGCGAATTATCTGGGCATATAAGATTTTGTTCCTTGATGTTATATTTCCCCTTTCATTAGAAAAGGTTATATTTGTTGATGCTGATCAAGTTGTTAGGGCGGATGTGGGAGAACTCTATGACATGGATATAAAGGGAAGACCTCTTGCATATACTCCTTTTTGTGACAACAATAAGGACATGGATGGATATCGATTTTGGAGACAAGGATTCTGGAAAGAGCATTTACGGGGTAGACCATACCATATAAGTGCATTGTACGTGGTTGACTTGGTGAAGTTTCGTGAGACTGCAGCAGGAGATAATTTGAGAGTCTTTTATGAAACTCTTAGCAAGGATCCAAACAGTCTATCCAATCTGGATCAGGATCTTCCAAACTATGCTCAGCATACAGTACCCATCTTTTCATTACCCCAAGAATGGCTATGGTGTGAATCATGGTGTGGTAATGCCACAAAATCTAGGGCAAAAACCATTGATCTTTGCAACAATCCAATGACAAAAGAACCAAAACTTAAGGGTGCTAGAAGAATAGTTTCTGAGTGGACGAATCTTGACTTTGAGGCAAGAAACTTCACTGCCAAAATATTAGGTGATGAACTGGACAACCCAGAGCCAGTAGCATCATCTGAGACCTCCTCAAATGAAAGTTCATCAGAAGATCTAGAATCTAAGGCGGAGTTGTGA |
Protein: METRFRSRLCILIVLACVIFCGFTSVGAQNRRPKNVQAAIRAKWSGTPLLLEAGELLSKESKNLFWEFFDDWLHVAKTGGDSHSAKDCLKKILKHGSSLLSETLSSLFEFSLTLRSASPRLVLYRQLAEESLSSFPLGDDSYSNNVNGLDASETLETIKLDPLLVGINPRSPGGKCCWVDTGGALFFDVAELLLWLQRPNELGVDSFQQPELYDFDHIHFDSNIMSPVAILYGALGTNCFKEFHVTLVQAAKEGKVKYVVRPVLPSGCEAEVGLCGAVGARDSLNLGGYGVELALKNMEYKAIDDSTVKKGVTLEDPRTEDLSQEVRGFIFSKMLERKPELTSEIMAFRDYLMSSTISDTLDVWELKDLGHQTAQRIVQASDPLQSMQEISQNFPSVVSSLSRMKLNDSVKDEIIANQRMIPPGKSLMALNGALINIEDIDLYLLIDLIHRELSLADQFSKLKIPQGTVRKLLSTVTPPESDMFRVDFRSSHVHYLNNLEEDAMYRRWRSNINDILMPVFPGQLRYIRKNLFHAVYVLDPATVCGLQSIDMITTFYENSFPMRFGVILYSTQFIKKIEMSGGELHSSSLEHDSEIEDDKSILIIRLFIYIKENHGTQTAFQFLSNVNRLRIESAESTDDALEMHHIEEAFVETVLPKAKSPPQEVLLKLQKESTFKELSEESSLFVFKLGVGKLQCCLLMNGLVLDSSEEALINAMNDELPRIQEQVYYGQINSHTDVLDKFLSENGVSRYNPQIIVDGKAKPRFISLASSILGGESVLNDINYLHSPETVDNVKPVTHLLAVDITSKKGIKLLREGIRYLIGGTKGARVGVLFSASQDANLPSLLLVKTFEITAASYSHKKKVLEFLDQACSFYEHNYIVRSPTSAESTQAFINKVYELAEANELSSKAYKSSPPEASAQELREHLNKVAQFLYRQFGIASGVNAVITNGRVTSLDAGVFLSHDLHLLESVEFKHRIKHIVQIIEEVNWQGLDPDMLTSKYVSDIVMFVSSSMATRDRSTESARFEVLNAQHSAVVLNNENSSIHIDAVVDPLSPFGQKLSSLLRVLAKYVHPSMRIVLNPLSSLVDLPLKNYYRYVVPTMDDFSSTDYTVNGPKAFFANMPLSKTLTMNLDVPEPWLVEPIIAVHDLDNILLENLGETRTLQAVFELEALVLTGHCTEKDRDPPRGLQLILGTKNTPHLVDTIVMANLGYWQMKVSPGVWYLQLAPGRSSELYLFRDGGDNGSQEKSLSKRITINDLRGKVVHLEVVKKKGKEHEKLLISADDDSHSKEKRQGHNGWNSNFLKWASGFIGGSEQSKKNNDSLVEHGKGGRLGKAINIFSIASGHLYERFLKIMILSVLKNTRRPVKFWFIKNYLSPQFKDVIPHMAQEYGFEYELITYKWPTWLHKQKEKQRIIWAYKILFLDVIFPLSLEKVIFVDADQVVRADVGELYDMDIKGRPLAYTPFCDNNKDMDGYRFWRQGFWKEHLRGRPYHISALYVVDLVKFRETAAGDNLRVFYETLSKDPNSLSNLDQDLPNYAQHTVPIFSLPQEWLWCESWCGNATKSRAKTIDLCNNPMTKEPKLKGARRIVSEWTNLDFEARNFTAKILGDELDNPEPVASSETSSNESSSEDLESKAEL |